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Organizational performance is one of the most important constructs in management research. 
Reviewing past studies reveals a multidimensional conceptualization of organizational performance 
related predominately to stakeholders, heterogeneous product market circumstances, and time. A 
review of the operationalization of performance highlights the limited effectiveness of commonly 
accepted measurement practices in tapping this multidimensionality. Addressing these findings 
requires researchers to (a) possess a strong theoretical rationale on the nature of performance (i.e., 
theory establishing which measures are appropriate to the research context) and (b) rely on strong 
theory as to the nature of measures (i.e., theory establishing which measures should be combined and 
the method for doing so). All management research on performance should explicitly address these 
two requirements. The authors conclude with a call for research that examines triangulation using 
multiple measures, longitudinal data and alternative methodological formulations as methods of 
appropriately aligning research contexts with the measurement of organizational performance. 
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Organizational performance is the ultimate dependent variable of interest for researchers con- 
cermed with just about any area of management. Market competition for customers, inputs, and 
capital make organizational performance essential to the survival and success of the modern busi- 
ness. As a consequence, this construct has acquired a central role as the deemed goal of modern 
industrial activity. Marketing, operations, human resources (HR), and strategy are all ultimately 
judged by their contribution to organizational performance. Measuring it is essential in allowing 
researchers and managers to evaluate the specific actions of firms and managers, where firms 
stand vis-a-vis their rivals, and how firms evolve and perform over time. Its importance as the 
ultimate evaluative criterion is reflected in its pervasive use as a dependent variable. March and 
Sutton (1997) found that of 439 articles in the Strategic Management Journal, the Academy of 
Management Journal, and Administrative Science Quarterly over a 3-year period, 23% included 
some measure of performance as a dependent variable.' In contrast to the dominant role that 
organizational performance plays in management fields is the limited attention paid by research- 
ers to what performance is and how it is measured. 

The definition of organizational performance is a surprisingly open question with few 
studies using consistent definitions and measures (see Kirby, 2005). Performance is so com- 
mon in management research that its structure and definition are rarely explicitly justified; 
instead, its appropriateness, in no matter what form, is unquestionably assumed (March & 
Sutton, 1997). It is sad that, 10 years after March and Sutton’s call to virtue, these assump- 
tions continue largely unquestioned. Reviewing the last 3 years of the journals examined by 
March and Sutton as well as the Journal of International Business Studies and the Journal 
of Management, we identified 213 papers—29% of the total published in those journals— 
that included organizational performance as a dependent, independent, or control variable 
(see the appendix). Table 1 summarizes the measurement of performance used in these 
papers. The measures ranged from an assortment of financial operating ratios (e.g., net profit 
after taxes [NPAT] and the return on equity [ROE]) to measures of successful outcomes (e.g., 
Food and Drug Administration [FDA] approvals and survivorship) to broad subjective per- 
ceptions of relative performance variously measured. Overall, across the 213 papers identi- 
fied as including a performance variable, 207 different measures of performance were used. 
The diversity of approaches was further complicated by variation in the use of single, mul- 
tiple, and aggregated measures. What stands out from this examination is the lack of clarity 
in the theoretical definition of performance and the absence of methodological consistency 
in the formulation of the construct(s) used. Together, these limitations make effective posi- 
tive scientific comparisons between these papers difficult and normative recommendations 
hard to justify as having any degree of prescriptive validity. 

Our review indicates that despite its recognized importance, researchers still pay little 
theoretical attention to, or display methodological rigor about, the choice, construction, 
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Table 1 
Measurement of performance 2005-2007 in leading 
academic management journals* 


AMJ ASQ JIBS JOM SMI Total 
Articles in period 188 49 157 120 208 722 
Articles using a performance variable 45 9 39 22 98 213 
Percentage using performance as a:° 
Dependent variable 73% 67% 62% 55% 72% 69% 
Independent variable 9% 44% 18% 23% 21% 19% 
Control variable 36% 11% 38% 55% 33% 36% 
Source of data 
Survey only 22% 11% 33% 27% 19% 23% 
Secondary only 53% 78% 54% 64% 60% 59% 
Both 24% 11% 13% 9% 20% 18% 
Operationalization” 
Objective 16% 89% 62% 59% 719% 73% 
Subjective 22% 11% 36% 36% 22% 26% 
Quasi-objective 9% 0% 10% 5% 5% 7% 
Performance measures used? 
Accounting (objective) 47% 33% 49% 50% 60% 53% 
Accounting (subjective) 1% 0% 18% 18% 11% 12% 
Financial market (objective) 18% 22% 15% 14% 18% 17% 
Financial market (subjective) 2% 0% 0% 0% 0% 0% 
Tobin’s g or other mixed measure 4% 11% 5% 9% 17% 11% 
Reputation or perception of effectiveness 11% 11% 23% 41% 16% 19% 
(subjective) 
Sales, market share, or related measure 24% 22% 15% 18% 9% 15% 
(objective) 
Sales, market share, or related measure 2% 0% 26% 14% 10% 11% 
(subjective) 
Survival 4% 22% 8% 9% 8% 8% 
Number of measures 
Percentage using a single performance 60% 44% 33% 45% 52% 49% 
measure 
Percentage where multiple measures 40% 56% 67% 55% 48% 51% 
were collected 
Multiple measures used in: 
Multiple separate analyses 56% 80% 54% 58% 719% 67% 
Aggregation (factor analysis) 17% 20% 38% 25% 21% 25% 
Aggregation (averaging) 28% 20% 23% 33% 19% 23% 
Percentage modeling longitudinal data 53% 67% 36% 32% 52% 48% 


a. Academy of Management Journal (AMJ), Administrative Science Quarterly (ASQ), Journal of International 
Business Studies (JIBS), Journal of Management (JOM), and Strategic Management Journal (SMJ). 
b. Percentages may add up to > 100% due to multiple categorizations per article. 
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and use of the plethora of performance measures available to them. Our goal is to 
reignite the debate surrounding the meaning and measurement of organizational perfor- 
mance with particular emphasis on financial measures and the use of performance as a 
dependent variable. 

This debate is critical, first, to update research thinking. Although several foundational reviews 
on measuring performance were published in the 1980s (e.g., Chakravarthy, 1986; Venkatraman 
& Ramanujam, 1986), the past 20 years have seen the advent of a greater number of performance 
measures and with a broader ambit (e.g., the triple bottom line) and important empirical and 
methodological developments that have yet to be comprehensively integrated. 

The broadening of the domain of organizational performance has implications for both 
methodological practice and the relationship between academic research and management 
practice. Stated simply, we may not be measuring the performance to which managers are 
managing. For example, Boyd, Gove, and Hitt (2005) found that in the papers published by 
the four leading management journals from 1998 to 2000, performance was the most com- 
mon dependent and in 38.1% of the cases it was measured using a single indicator. Only 
19.6% of these studies used statistically constructed scales that allowed the measurement 
structure and error to be evaluated (Boyd et al., 2005). We find similar results with single 
indicators used in 49% of the papers we reviewed. The frequent use of single item measures 
indicates a continuing need to integrate methodological developments into practice. 

Second, the debate is critical to establish the relevance and applicability of academic 
management research. The inability to understand and characterize performance consistently 
reduces the effect and relevance of management research. Inconsistency leads to a diver- 
gence in results, undermining the clarity of the findings and the confidence of researchers 
and managers alike. With a more coherent understanding of the meaning and measurement 
of performance, we hope to make a small step toward convergence in understanding the most 
common dependent variable of interest in our research. 

The structure of the remainder of this review is as follows. We begin by reviewing the domain 
of organizational performance. This domain stretches across a diversity of users and uses that 
make the construct of organizational performance both complex and multidimensional. We then 
consider the literature on the measures available. Uniting these literatures provides guidance on 
methodological best practice in measuring performance. An important agenda for further research 
that seeks to validate these measurement approaches is then suggested. 


Review of the Domains of Organizational Performance 


In examining the construct of organizational performance, we start by defining organiza- 
tional performance. This requires the examination of the closely related construct of organi- 
zational effectiveness. We then examine the dimensionality of the construct of organizational 
performance in detail. This dimensionality arises out of the stakeholders that interact with 
and within an organization, the heterogeneity of organizational resources, environments and 
strategic choices, and the variation of performance over time. 
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Organizational Performance Versus Organizational Effectiveness 


Although organizational performance dominates the strategic management literature, not 
to mention economics, finance, and accounting, it is not unchallenged. Performance is one 
type of effectiveness indicator, with both advantages and disadvantages. Hence, we first need 
to distinguish between organizational performance and the more general construct of orga- 
nizational effectiveness (Venkatraman & Ramanujam, 1986). Organizational effectiveness is 
a broader construct that captures organizational performance, but with grounding in organi- 
zational theory that entertains alternate performance goals (Cameron & Whetten, 1983). 
Management research in general, and strategic management research more specifically, has 
taken a more limited empirical view, emphasizing the central role of accounting, financial, 
and stock market outcomes. To simplify this discussion and remain consistent with the usage 
in the literature, we will distinguish between organizational effectiveness and organizational 
performance. 


Organizational performance encompasses three specific areas of firm outcomes: (a) financial 
performance (profits, return on assets, return on investment, etc.); (b) product market perfor- 
mance (sales, market share, etc.); and (c) shareholder return (total shareholder return, economic 
value added, etc.). 

Organizational effectiveness is broader and captures organizational performance plus the plethora 
of internal performance outcomes normally associated with more efficient or effective opera- 
tions and other external measures that relate to considerations that are broader than those simply 
associated with economic valuation (either by shareholders, managers, or customers), such as 
corporate social responsibility. 


Although innovation and efficiency measures are generally placed into the wider conceptual 
domain of organizational effectiveness (Cameron & Whetten, 1983; Venkatraman & 
Ramanujam, 1986), other management researchers have taken these same variables as their 
dependent performance measure (e.g., Capon, Farley, & Hoenig, 1990; Hall, Jaffe, & 
Trajtenberg, 2005).* The implementation of balanced scorecards has also increased the atten- 
tion given to wider aspects of organizational effectiveness in management research. Although 
primarily used for internal management and control, balanced scorecards explicitly include 
measures of financial performance, customer outcomes, innovation, and internal processes 
(Kaplan & Norton, 1996). Balanced scorecards are closely tailored to each individual firm. 
This tailoring makes comparing balanced scorecard results across firms almost impossible, 
given that the implementation of a balanced scorecard for a single firm is already complex 
and difficult (Neely & Bourne, 2000; Schneiderman, 1999). It is for this reason that organi- 
zational performance dominates organizational effectiveness for management researchers. 
The narrower domain of organizational performance provides the useful potential to make 
meaningful comparisons across firms and industries. 

What is evident, however, is that even with a narrower domain, organizational perfor- 
mance is not a one-dimensional theoretical construct nor is it likely to be characterizable 
with a single operational measure. Although the multidimensionality of performance is rec- 
ognized in accounting (e.g., Callen, 1991) and finance (e.g., Henri, 2004) and discussed 
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theoretically in the management literature (Venkatraman & Ramanujam, 1986), empirically, 
the lack of consistency in the measurement of organizational performance in management 
research has revealed a surprising lack of researchers “walking the walk.” 

It follows that before we can discuss the importance of operational measurement issues, 
it is critical that we examine the sources of the multidimensionality of organizational perfor- 
mance in detail. Our review suggests that dimensionality is related to three sources within 
the research context: (a) Who are the stakeholders for whom a performance measure is rel- 
evant? (b) What is the Jandscape over which performance is being determined? and (c) What 
timeframe is relevant in measuring performance? Validly measuring performance requires 
allowing for these sources of multidimensionality. 


Source of Dimensionality 1: The Stakeholders 


Few would argue with the contention that all stakeholders have some potential claim to 
the rents arising from the activities of the firm (Freeman, 1984). Viewed broadly, stakehold- 
ers include anyone who “can affect or is affected by the achievement of the organization’s 
objectives” (Freeman, 1984). They can have different motivations that imply different mea- 
surement needs; for example, managers, employees, suppliers, customers, stockholders, 
governments, and nongovernmental organizations (NGOs) would all concentrate on mea- 
sures of performance most directly related to their own goals (Fitzgerald & Storbeck, 2003; 
Hillman & Keim, 2001). Independent of whether one views the normative or descriptive 
view of stakeholder theory as relevant, or whether potential claims to organizational rents are 
legitimate, there is little doubt that, in reality, an organization’s management engages in an 
active tradeoff of stakeholder interests (Blattberg, 2000; Mitchell, Agle, & Wood, 1997). 
This can be seen both at the organizational level and the level of the country and has implica- 
tions for the applicability of measures of performance. 

The “Anglo-American” model of the United States and the United Kingdom gives pri- 
macy to the interests of shareholders (Dore, 2000). If shareholders were the sole stakeholders 
to be considered, then maximization of some measure of shareholder return can be justified 
as the sole criterion of performance. However, in practice, other stakeholders such as the 
firm’s managers and suppliers must be considered. This is particularly so for firms from 
Continental Europe and Japan where more attention is placed on the interests of broader 
stakeholders (Dore, 2000). Adding in other stakeholders—such as employees, partners, 
NGOs, and society at large—increases the dimensionality of performance to include items 
such as employment conditions and environmental sustainability, bringing it closer to that of 
organizational effectiveness. In this context, the selection of a single measure may bias mea- 
surement by ignoring the distribution of value created across stakeholder groups. 

Operationally, management research has adopted a much narrower definition of stake- 
holders that focuses on a firm’s economic interests and parties that bear risks in relation to 
them in order to narrowly define legitimate claims (e.g., Clarkson, 1995). This is reflected 
in the popularity of financial and accounting measures, which are closely associated with 
managers and shareholders, two of the highest legitimacy stakeholder categories (Mitchell 
et al., 1997).* A popular distinction is between primary stakeholders like suppliers and 
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customers who have a direct exchange relationship with the firm and secondary stakeholders 
who do not (Clarkson, 1995). Relationships with primary stakeholders are characterized by 
a degree of mutual dependence, whereas secondary stakeholders have little effect on the firm 
(Casciaro & Piskorski, 2005). Analyses of firm social performance indicate that meeting the 
wider interests of stakeholders with a close connection to the firm is associated with higher per- 
formance as measured by return on assets (ROA) and earnings-per-share (EPS). In contrast, 
higher performance on dimensions of interest to secondary stakeholders has not translated into 
higher financial performance (Van der Laan, Van Ees, & Van Witteloostuijn, 2008). 

The nature of these measures can also be firm specific, depending on internal policies. 
The selection of remuneration policies will affect the relevance of particular measures for 
staff groups. For instance, cash flows, accounting numbers, and stock prices produce differ- 
ent incentives for managers (Dutta & Reichelstein, 2005). More broadly, management strat- 
egy also leads to different organizations placing emphasis on different aspects of their 
performance. For instance, organizations that rely heavily on debt financing would be 
expected to manage investments more in line with the demands of these stakeholders and 
hence engage in less risky investment activity (Devinney & Milde, 1990). 


Implication 1: Measuring performance requires weighing the relevance of performance to focal 
stakeholders. 


Source of Dimensionality 2: Heterogeneity—Resources, 
the Environment, and Strategic Choice 


Organizations are heterogeneous in their resources and capabilities and how and 
where they choose to use them (Barney, 1991). At the most basic level, small and large 
firms are likely to perform in quite different manners. Although linked by competition, 
these firms have very different resources and strategies. This is evident in how firms 
measure themselves. Large organizations use both financial and nonfinancial perfor- 
mance measures but favor financial measures (Malina & Selto, 2004). Small firms also 
use both financial and nonfinancial variables to measure their performance. In a cross- 
country survey, Laitinen and Chong (2006) found that small Finnish companies focused 
on profitability, product margins, customer satisfaction, and liquidity. Small U.K. com- 
panies were similar, giving less emphasis to overall profitability but also weighing debt 
levels highly. This evidence supported earlier findings by Davig and colleagues (Davig, 
Elbert, & Brown, 2004) that product performance is more prominent in the evaluation 
of performance for small firms. 

Industry and environmental context frames resources and strategies, moderating perfor- 
mance outcomes (Hawawini, Subramanian, & Verdin, 2003; McGahan & Porter, 1997; 
Rumelt, 1991). For example, McGahan (2004) identifies four trajectories of industry change: 
progressive, radical, intermediating, and creative. The appropriateness of the strategic 
choices firms can make depends critically on the trajectory of an industry, the implication 
being that the performance of sets of resources and assets will depend predominantly on the 
trajectory in which they are applied (McGahan, 2004). McGahan likens these trajectories to 
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highways, which are bounded by largely immovable barriers that keep traffic on the road. As 
a consequence, they strictly define the set of performance outcomes that are potentially 
available to the firms situated within their constraints trajectory. Organizational responses to 
environmental factors also differ (Miles & Snow, 1978). This heterogeneity generates mul- 
tiple paths to performance that are influenced by a broad variety of factors (Hofer, 1983; 
Lenz, 1981). The validity of comparisons will be highest where firms are on the same path. 
A survey of executives of 321 business units found that performance comparisons within 
strategic groups were the most valid (Davis & Pett, 2002). 

Performance itself is likely to be somewhat firm specific, as the strategic choices 
a firm makes will dictate which performance measures will reflect the latent perfor- 
mance construct (Steers, 1975). Understanding how different independent variables 
link to a dependent performance variable is then no longer trivial (March & Sutton, 
1997). Assuming away this dimensionality will lead to misdirected or biased measure- 
ment. From a measurement perspective, it is unlikely that changing strategies leaves the 
dimensionality of the performance indicators unchanged. Because different strategies 
relate to different dimensions of performance, so they also alter the way these perfor- 
mance dimensions load onto the latent construct. For instance, Fryxell and Barton 
(1990) found that the measurement structure of four common accounting and stock- 
market-based indicators (that is, measures commonly regarded as objective and within 
the domain of financial performance) differed between groups of firms that had adopted 
two distinct strategic stances and over time. The factor loadings changed materially and 
in counterintuitive ways. Hence, not only does the appropriate measure of performance 
vary with the situation, but the measurement structure of indicators that capture that 
latent construct will change also. 

The relationship between measures and performance is also influenced by which mea- 
sures the firm uses internally and how these are embedded into incentive and control systems 
within the firm—for example, the firm’s own key performance indicators (KPIs). In other 
words, the internal measurement systems used will influence performance at the individual 
and organizational levels (Levenson, Van der Stede, & Cohen, 2006). This reveals an impor- 
tant gap in academic publications, as few, if any, empirical papers using firm performance as 
a dependent variable account for the internal incentive systems operating in the firms they 
are studying. Hence, we are making a quantum leap of faith in assuming that our measures 
relate to what the firm is seeking to achieve. Indeed, both Stern Stewart (Stern, Stewart, & 
Chew, 1995) and Kaplan and Norton (1996) emphasize that their measurement systems must 
play an active role in the management of the adopting firms in order to be effective. And to 
their credit, the adoption of residual income measures, such as economic value added (EVA) 
as promoted by Stern Stewart, has been shown to alter managerial decision making. Adopters 
of EVA show tighter investment control, more capital being returned to shareholders, and 
greater increases in residual income (Wallace, 1997). 


Implication 2: Measurement of performance must take into account heterogeneity of environments, 
strategies, and management practices. 
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Source of Dimensionality 3: The Measurement Timeframe and the Persistence 
of Performance 


A number of empirical studies have confirmed that performance itself does not persist indefi- 
nitely (Jacobson, 1988; Waring, 1996). This warns against the adoption of short- or medium-term 
measures, as these can be heavily biased by random fluctuations. In a famous example, a third of 
the “excellent” firms identified as superior long-term performers by Peters and Waterman (1982) 
in their study for Jn Search of Excellence had run into significant problems by the time the second 
edition of their very popular book went to press (““Who’s Excellent Now?,” 1984). As a group, 
the Peters and Waterman firms exhibited only average performance after publication (Aupperle, 
Acar, & Booth, 1986; Ramanujam & Venkatraman, 1988). By selecting firms based on a small 
number of criteria over a limited period of time, they failed to account for the natural variability 
and stickiness in performance. As a consequence, researchers and managers must be careful in 
the interpretation of performance differences, regardless of whether or not they sampled on their 
dependent variable. Empirically observed performance outcomes can be partially explained by 
random events and the nature of the models used to capture performance over time (Powell, 
2003). Even completely random Markov processes can cause heterogeneous performance out- 
comes in the absence of firm-specific heterogeneity (Denrell, 2004). This warns against the adop- 
tion of short- or medium-term measures, as these can be heavily biased by random fluctuations. 

Changes in performance have been shown to occur at different rates for industry, corpo- 
rate, and business unit factors (McGahan & Porter, 1999, 2003). Research that targets spe- 
cific factors needs to measure performance in a manner that matches that context. There is 
evidence that suggests that industry factors can sustain performance over the longer term 
(Waring, 1996). Consistent with this, McGahan and Porter (1999) found that industry-driven 
performance effects persisted at a rate of 76.6% to 81.8% but that firm-specific performance 
persisted at weaker rates of only 47.9% to 65.5%. As a consequence, studies should select a 
measurement timeframe in accordance with the phenomena they are examining. 

Many performance measures are themselves time dependent. Reputation effects link past 
performance to future performance, creating feedback within the dimensionality of perfor- 
mance itself (McGuire, Schneeweis, & Branch, 1990; Roberts & Dowling, 2002). Even 
when persistence suggests the presence of ongoing competitive advantage, that performance 
persistence is partially attributable to the time series characteristics that lead to stickiness in 
return measures (Jacobson, 1987, 1988). Auto-correlation can also be caused by unwanted 
biases. For instance, subjective measures are susceptible to bias arising from the availability 
of recent events (Tversky & Kahneman, 1973), whereas other relationships between mea- 
sures, including the halo effect (Rosenzweig, 2007), can lead to spurious relationships being 
inferred as substantive (Edwards & Bagozzi, 2000). Moreover, many objective measures, 
including accounting rates of return, have temporal properties that imply that the internal 
antecedents of performance in any one year may not relate directly to performance in that 
year even though they appear to be highly correlated (Jacobson, 1987). 

It is correctly argued by proponents of shareholder value that one needs a measure of 
performance that not only measures how a decision determines performance today but for- 
ever into the future, in essence, removing the timeframe from the equation (Benston, 1985; 
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Rappaport, 1978). In theory, with perfect information, total shareholder return is just such a 
measure. However, there are three issues that distort shareholder return as anything but an 
unbiased estimate of total value. First, information is imperfect and the financial market 
return can be, at best, a consensus forecast of value made by human beings. This is a fact as 
true for investors as it is for managers, who arguably possess better information. Second, the 
market return represents a consensus forecast at a point in time. This forecast is subject to 
updating as information becomes available. Third, the financial market itself is not free from 
structurally and psychologically induced biases. For example, the Internet boom-and-bust of 
the late 1990s represented a mixture of just such biases (Bond & Cummins, 2000). 


Implication 3: Measurement of performance requires an understanding of the time series properties 
relating organizational activity to performance. 


Review of the Measurement of Organizational Performance 


There are three common approaches to organizational performance measurement seen in 
the literature. The first is where a single measure is adopted based on the belief in the rela- 
tionship of that measure to performance (e.g., Hawawini et al., 2003; Hillman & Keim, 2001; 
Roberts & Dowling, 2002; Spanos, Zaralis, & Lioukas, 2004). Ideally, these beliefs are sup- 
ported by theory and evidence but, as noted above, are often merely assumed. The second 
approach is where the researcher uses several different measures to compare analyses with 
different dependent but identical independent variables (e.g., Baum & Wally, 2003; 
Contractor, Kundu, & Hsu, 2003; Miller, 2004; Peng, 2004). The third approach is where the 
researcher aggregates dependent variables, assuming convergent validity based on the cor- 
relation between the measures (e.g., Cho & Pucik, 2005; Goerzen & Beamish, 2003). This 
is most common with subjective measures of performance where the investigator is seeking 
something akin to trait-based psychometric validity (see Varadarajan & Ramanujam, 1990). 
However, it is not uncommon to see operational and financial market measures also being 
aggregated (see Rowe & Morrow, 1999). The justifiability of these approaches depends 
crucially on whether the specific measures used meet the theoretical, statistical, and psycho- 
metric assumptions made. We address the nature of these specific measures below by first 
examining the objective measures of performance—accounting and financial market mea- 
sures, plus firm survival—followed by subjective and quasi subjective measures—such as 
survey-based self-reports and Likert responses. These measures target the financial, product 
market, and shareholder outcomes that constitute performance. 


Objective Measures of Organizational Performance 


Accounting measures. Accounting measures are the most common and readily available 
means of measuring organizational performance (a summary of accounting measures is 
included in Table 2). The validity of their use is found in the extensive evidence showing that 
accounting and economic returns are related. For instance, Danielson and Press (2003) found 
that the correlation between accounting and economic rates of return was above 0.75, and 
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Jacobson (1987) found that despite a weak R? of 0.2, return on investment (ROT) was able to 
distinguish performance between firms and over time. Nevertheless, researchers must be 
cognizant of the fact that these measures can be distorted by accounting policies, human 
error, and deception. 

It is important to note that the rules that accounting systems are based on (such as GAAP 
standards) are not always consistent with the underlying theoretical logic of organizational 
performance. For instance, choices about depreciation schedules, inventory, and booking 
expenses can undermine the ability to accurately tap the time dimension. To rigorously apply 
accounting measures, one must understand the nature of the rules (equations) that define the 
measure of interest. However, researchers rarely have the inclination, time, or data to achieve 
this.* Another important limitation of accounting performance measures is that they empha- 
size historic activity over future performance (Keats, 1988). Due to their reliance on audit- 
able sources, accounting measures reflect what has happened and can be quite limited in 
anticipating and revealing expectations about future performance—a fact that possesses both 
a negative side, as in the case of Enron, and a positive side, as in the case of many early 
Internet companies. Hence, the apparent predictability and validity of accounting measures 
as signals of economic returns may have less to do with their validity and more to do with 
the stationary properties of the environment in which the measurement is taking place. The 
implication is that the more turbulent the environment is, the less clear the rules of perfor- 
mance are, and the more variable the regulatory and institutional environment in which firms 
are operating, the less valid and comparable are accounting measures as signals of economic 
returns. For instance, Jusoh and Parnell (2008) encountered difficulties applying Western 
accounting measures in the emerging Malaysian environment but found that measures of 
organizational effectiveness, such as customer and employee satisfaction, were more robust. 
A case study on measurement in Vietnam also found accounting measures to be a biased 
reflection of performance (Luu, Kim, Cao, & Park, 2008). 

Although normally considered a marketing measure, market share is another product of the 
accounting system. Market share is the ratio of the sales of the firm to the total sales in the relevant 
product market. As a consequence, it is composed of the accounting sales figures of all firms in 
an industry or product market. There is some disagreement over the classification of market share, 
some studies viewing it as an antecedent of financial performance (e.g., Capon et al., 1990) and 
others as a performance measure in its own right (see the appendix). 


Financial market measures. Within the strategy, economics, and finance literatures, finan- 
cial market-based measures—most dominantly shareholder return—are the preferred instru- 
ment for characterizing organizational performance (see Table 3 for a summary of financial 
market measures). The greatest strength of these measures is that they are forward looking, 
in theory representing the discounted present value of future cash flows (Fisher & McGowan, 
1983). They also incorporate intangible assets more effectively than accounting data (Lev, 
2001), something of clear relevance to those interested in resource-based and knowledge- 
based views of the firm. However, the connection between financial market measures, such 
as the stock price or excess stock returns, to the actual performance of the firm depends on 
how much of the rent generated from its activities flows to shareholders and the informa- 
tional efficiency of the financial market. The usual justification of these measures is that 
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Cash flow from operations 


Earnings before interest and taxes 
(EBIT) 


Earnings before interest, taxes, 
depreciation, and amortization 
(EBITDA) 


Market share 


Net operating profits (also known as 
earnings) 


Net operating profit less adjusted 
taxes (NOPLAT) (also known as 
net operating profit after tax 
[NOPAT]) 


Profit margin 
Return on assets (ROA) 


Return on book-valued assets 


Return on capital employed (ROCE) 
(often called return on capital 


[ROC]) 


Return on equity (ROE) 
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Table 2 
Accounting measures 


This accounting measure is used to examine whether cash flow 
differs significantly from earnings. It is defined as net operating 
profit plus noncash expenses minus noncash sales. 

This basic measure is often recorded on accounting statements as 
operating profit. This is the firm’s profit, which is defined as 
revenues minus costs of goods sold and administrative and 
selling costs associated with the firm’s operations. Interest and 
taxes the firm must pay are not deducted in the calculation of 
EBIT. 

Like EBIT, EBITDA is defined as the firm’s operating profit and does 
not make any allowance for interest and taxes that must be paid. It 
is also adjusted to remove the effects of noncash expenses such as 
depreciation and amortization (these are deducted from the cost 
component). 

Firm’s sales revenue in the product market divided by the total sales 
revenue available in that market. Although it can be defined as unit 
sales volume divided by the total volume of units sold in that 
product market, this is very rare in management research. 

This is equal to the firm’s revenue minus the cost of goods sold and 
selling, general and administrative expenses. Taxes and interest are 
removed to reach this net figure. 

This measure is similar to net operating profit but is adjusted to 
remove several accounting distortions. It provides a cash-based 
measure of net operating profit. Typically, this requires subtract- 
ing taxes after making adjustments for the effect of tax deferrals 
and taxes on interest and nonoperating income, adding back 
lease expenses and unwinding the amortization of goodwill. 
Some consultants make up to 160 adjustments. Interest costs are 
not subtracted; this is important as this measure is often used in 
EVA calculations that take interest costs into account by allow- 
ing for the cost of capital separately. 

This is the ratio of net operating profit to sales. 

This is a very popular accounting measure of performance. It is 
defined as the ratio of net operating profit to the firm’s start-of-year 
assets recorded on its balance sheet. 

This is return on assets but using the end-of-year book value of assets. 

ROCE is a measure of how well a firm is using capital to generate 
revenue. It is defined as EBIT divided by employed capital. 
Employed capital includes long-term debt and is equal to total 
assets less current liabilities and the value of intangible assets. 

A measure of how much the firm generates for its owners, ROE is 
equal to net profit divided by the book value of shareholder’s 
equity. Shareholder’s equity usually includes the value of reserves 
as these could be paid out to shareholders. 


(continued) 
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Table 2 (continued) 


Return on investment (ROD) 


Return on invested capital (ROIC) 


Return on net assets (RONA) 


Return on sales (ROS) 


Return on total assets 


Risk-adjusted return on capital 
(RAROC) (also known as return on 
risk-adjusted capital [RORAC]) 


This is a leading traditional measure. ROI is usually defined as the 
ratio of net operating profit to the net book value of assets. The net 
book value of assets is equal to the firm’s assets less the value of 
intangibles and total liabilities. In recent times, an increasing num- 
ber of publications use NOPLAT and other adjusted profit mea- 
sures as the numerator. 


This increasingly popular measure is defined as the ratio of NOPLAT 
to the firm’s invested capital. Invested capital is defined as total 
assets less excess cash and the value of noninterest-bearing current 
liabilities. These two adjustments to total assets are intended to 
remove the effects of assets that do not need to be supported by 
capital. 

This measure focuses on the assets the firm needs to generate its 
profit. It is calculated as the ratio of NOPLAT to net assets. Net 
assets is defined as fixed assets plus cash plus required working 
capital. This measure is closely related to EVA, as it is sometimes 
defined as EVA = (RONA — WACC) x Invested Capital. 

This is the ratio of net operating profit to the sales made by the firm 
in the period. 

This is the ratio of earnings available to common stockholders to the 
firm’s assets. This is virtually identical to return on assets, the use 
of total in the name signals that net profit (earnings) is adjusted to 
remove dividends for preference shares and other nonresidual 
claims (although most versions of ROA also do this anyway). 

This measure is used primarily by financial institutions. It is defined 
as the ratio of risk-adjusted earnings to economic capital 
employed. Here, the capital employed is evaluated relative to the 


market, credit, and operational risk involved. The results of a 
RAROC model are then generally used in calculating EVA or 
another measurement that accounts for risk. 


Sales This is the firm’s revenue from goods sold. 


Sales growth This is the change in sales over the period, expressed as the differ- 
ence between sales last period and those this period as a percent- 
age of the sales last period. 

Variance in accounting profitability A common accounting measure of risk is to use the variance in 
accounting profitability. This is often based on the volatility of one 


of the returns, such as ROA or ROI. 


firms are instruments of shareholders. But, as noted above, this assumption about the stake- 
holder dimension may not be as applicable in Continental Europe or Japan (Dore, 2000). Moreover, 
research on psychological and other influences suggests that market values do not simply reflect an 
efficient appraisal of future cash flows (Kahneman & Riepe, 1998; Malkiel, 2003). 

Empirical research in finance has shown that only a small proportion of share price move- 
ment is explained by systematic economic effects (Cutler, Poterba, & Summers, 1989; Roll, 
1988). Instead, share price movements are often attributable to financial market volatility 
(Shiller, 1989), momentum (Chan, Megadeath, & Lakonishok, 1996), and herding behavior 
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Table 3 
Financial market measures 


Beta coefficient The B-coefficient from the capital asset pricing model (CAPM). This is a 
measure of the level of systematic risk associated with the individual 
firm relative to the market portfolio. 


Earnings-per-share (EPS) This is a traditional measure of firm value. It is equal to net operating 
profit minus dividends paid to preference shares divided by the number 
of common stocks issued. 

Jensen’s alpha This is the a-coefficient from the CAPM. Jensen’s alpha is a measure of a 
firm’s excess return over that associated with the systematic risk of its 
operations. That is, this captures unique exceptional positive or 
negative performance. 


Market value (or market This is the total value of a firm’s common stock (which represents the 
capitalization) residual value of the firm’s resources). It is equal to the number of 
shares outstanding multiplied by their current stock price. 

Price-to-earnings ratio (P/E ratio) The P/E ratio is a common method of comparing firm valuations. It is 
defined as the ratio of the current stock price to the annual earnings per 
share the firm pays out. 

Return on market-valued assets Return on market-valued assets is the annual operating income divided by 
the beginning-of-year market value of equity plus the book value of 
long-term debt. 

Stock price This is the price of the firm’s listed common stock. 

Total shareholder return (TSR) Captures the gain (loss) made by shareholders during the period 
(generally each year). TSR is the sum of the change in stock price 
during the year plus any dividends paid out, expressed as a percentage 
of the opening value of the stock. 

Tracking stocks Securities issued that pay dividends based on the performance of some 
subset of the firm’s divisions (usually those from a single business 
unit). These provide a more pure reflection of the performance of a 
firm’s divisions (and are especially useful for multi-industry firms). 


(Graham, 1999; Grinblatt, Titman, & Wermers, 1995). The Internet boom-and-bust of the 
late 1990s represents a mixture of such biases (Bond & Cummins, 2000). One simply has to 
examine the situation associated with Worldcom and Enron to understand that such future 
forecasts are themselves conditional on available data. The role of this information is starkly 
confirmed by Chaney and Philipich (2002), who showed that the value of all Arthur Anderson 
clients declined when the veracity of Anderson’s audits was called into question and that this 
decline was even worse for those associated with the Houston office, which was responsible 
for auditing Enron. 

A major limitation of the use of financial market data in management research is that it 
evaluates the organization as a whole. For all but the few firms that have issued tracking 
stocks (Robinson, 2000), it is not possible to apportion market measures between activities 
(Jacobson, 1987). Therefore, although market value might be generally recognized as the 
most appropriate measure of overall organizational performance, it is less useful for research 
focusing on performance where the dimensionality is defined in terms of a product or a 
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strategic business unit (SBU). Attempts to disaggregate business unit performance are also 
problematic. Several researchers have pointed out that the logic of having a bundle of busi- 
ness units traded as a block implies that measuring financial performance and risk at the 
business unit level will be flawed due to the failure to account for the synergy and cannibal- 
ization associated with the interaction of the units (Bulow, Geanakoplos, & Klemperer, 
1985; Devinney & Stewart, 1988). Hence, even if one were measuring business unit level 
performance, it is unlikely that this alone will account for the performance of that unit within 
the strategic context of the corporation as a whole. 


Mixed accounting/financial market measures. An advantage of mixed accounting/financial 
market measures is that they are better able to balance risk (largely ignored by accounting mea- 
sures) against operational performance issues that are sometimes lost in market measures. 
Examples of mixed measures are given in Table 4. Tobin’s q is perhaps the earliest and most 
popular hybrid measure of firm performance. Tobin’s g is the ratio of the market value of firm 
assets to their replacement cost and is a theoretically based measure of economic return (Tobin, 
1969). One difficulty with the adoption of Tobin’s q is that the replacement value of the firm’s 
assets is almost always measured through its closely related proxy, the book value of assets 
(Varaiya, Kerin, & Weeks, 1987). This means that this is the historical rather than current replace- 
ment cost. Despite empirical similarity (a correlation between the two of more than 0.9 has been 
observed; Perfect & Wiles, 1994), the adoption of book value introduces the potential for a number 
of accounting distortions. This is seen in the failure of empirical Tobin’s g measures to include 
intangible assets in that replacement cost. Several authors have proposed formally measuring 
intangible assets and particularly the intangible intellectual capital resident within software, pat- 
ents, and employees (e.g., Edvinsson & Malone, 1997; Sveiby, 1997). However, despite these calls 
to improve the accounting of intangible assets, and their inclusion in firm financial statements, 
most current Financial Accounting Standards Board (FASB) accounting measures ignore them. 

This limitation has led to the development of a number of alternative mixed measures. 
Altman’s score (Altman, 1968) was one of the first such empirical measures and developed 
specifically to predict catastrophic financial events. The Z-score predicts a firm’s chance of 
bankruptcy, itself an extreme financial, product market, and shareholder outcome, using a 
combination of accounting and stock market measures including the ratios of working capi- 
tal to total assets, retained earnings to total assets, the market value of equity to the book 
value of liabilities, and sales to total assets (Altman, 1968). Stern Stewart’s trademarked 
EVA has become perhaps the most popular mixed measure (Stern et al., 1995). Early 
research suggested that EVA, which is based on the return over the cost of equity, was a bet- 
ter predictor than EPS, EPS growth (Milunovich & Tseui, 1996), ROA, ROE, and return on 
sales (ROS) (Lehn & Makhija, 1997). More recent studies have produced equivocal results, 
with accounting measures retaining incremental explanatory power (Chen & Dodd, 1997) 
and matching EVA (Biddle, Bowen, & Wallace, 1997; Chen & Dodd, 2001). 


Survival. Survival is a common dependent variable in management research, particularly 


in organizational sociology and entrepreneurship where increasing attention is given to 
ecological explanations of firm performance (Hannan & Freeman, 1977). Many firms fail, 
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Balanced scorecard 


Cash flow per share 


Cash flow return on 
investment (CFROI) 


Cash value added 
(CVA) 


Discounted cash flows 
(DCF) 


Economic value added 
(EVA) (the generic 
name for this is 
economic profit) 

Free cash flows 


Internal rate of return 
(IRR) 

Market-to-book value 

Market value added 
(MVA) 

Net present value 
(NPV) 

Shareholder value 
analysis (SVA) 


Tobin’s g 


Total business return 
(TBR) 
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Table 4 
Mixed accounting/financial market measures 


The balanced scorecard is a framework that draws together multiple measures aimed at 
financial performance, internal business processes, customer perspectives, and innovation 
and learning. The aim is to enable firms to build a comprehensive performance mea- 
surement system. There is a similar concept in French accounting called the Tableau 


de Bord. 
This is defined as the cash flow from operations minus preferred stock dividends divided by 


the number of common shares outstanding. This is a measure of the cash flow associated 
with each share. 

This is an inflation-adjusted approximation of the internal rate of return earned by a 
company over all its operating assets. Normally, this is done by discounting cash 
flow projections that are calculated based on ROI. 

The CVA is the difference between a firm’s operating cash flow (OCF) and the operating 
cash flow demand (OCFD) that it must pay shareholders. The OCF is the firm’s 
EBITDA (which includes only cash effects) less any working capital changes and 
nonstrategic investments made during the period. The OCFD is defined as the investors’ 
opportunity cost of the investment in cash terms. This provides a dollar value estimate of 
the net performance of the firm. 

This is the present value of future cash flows. These are discounted for the time-value 
of money, usually at the firm’s WACC. DCF models then compare future free cash 
flows to the debt and other cash investments required to support them. 

This highly popular measure adjusts accounting earnings for the cost of capital. It is 
normally defined as NOPLAT — (WACC x Invested Capital). The WACC is usually 
calculated through a rule of thumb, such as it being the risk free rate plus 6% multi- 
plied by the firm’s beta (Chen & Dodd, 1997). 

Free cash flows are the cash remaining for shareholders after all other claimants are 
paid. In each period, they are defined as the firm’s net operating profit less taxes, 
operating investment required to sustain the firm, and any additional working capital 
requirements. These are a key component of DCF calculations, which discount them 
back to present values. 

The IRR is the discount rate that results in the NPV of a series of future cash flows 
flowing from an investment being zero. 

The ratio of an organization’s market value to the book value of assets. 

Defined as the market value of the firm less the book value of debt and equity. Therefore, it 
represents the excess value of the firm over the capital used to support it. 

NPV is the difference between the present value (PV) of discounted future cash flows 
and the investment required to earn them. 

This measurement approach assesses shareholder value as the residual value of the firm. 
Generally, it is defined as shareholder value equal to corporate value minus debt. 
Corporate value is calculated by discounting future earnings at the cost of capital (or 
weighted average of the cost of debt and equity) and adding a residual value to capture 
the present value of cash flows outside the discounted period plus the current value 
of any liquid assets (such as cash or marketable securities) (Rappaport, 1986). 

This measure is defined as the ratio of the market value of the firm’s assets to their 
replacement cost. The market-to-book value is often used as a proxy as the replacement 
cost of the firm’s assets is difficult to estimate. 

TBR is closely associated with CFROI. It adopts an approach similar to TSR but is 
based on cash flows. TBR is defined as the terminal value of business less cash 
investments made during the period plus cash flow received during the period 
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Table 4 (continued) 


Warranted equity WEY is a modification of EVA used by financial institutions. Here, the cost of capital is 
value (WEV) calculated based on capital-at-risk (due to the prudential requirements applying to 
banks). 
Weighted average This is a measure of the cost the firm must pay for the capital it employs. It is the 
cost of weighted average of the cost of debt and the cost of equity. The cost of debt is usu- 
capital (WACC) ally adjusted to reflect the tax deductibility of interest expenses. 
Z-score Developed by Altman (1968), the Z-score provides an indication of the likelihood of a 


firm going bankrupt. It is based on a linear model of five common financial ratios: 
working capital/total assets, retained earnings/total assets, EBIT/total assets, market 
value of equity/book value of total liabilities, and sales/total assets. 


making survival pertinent to researchers and managers alike (Dunne, Roberts, & Samuelson, 
1988). Survival and financial performance outcomes are closely related, with an examination 
of financial market performance finding that delisting firms underperformed the median 
stock market return of firms on the NYSE and AMEX by 48% from 10 years to | year before 
delisting (Baker & Kennedy, 2002). This is consistent with theory that suggests that adapta- 
tion and selection operate simultaneously, with the relative importance of each depending on 
the degree of organizational inertia, and the nature of the research in terms of the period of 
interest (Hannan & Freeman, 1977). Survival is generally measured by a categorical variable 
capturing the ongoing presence of the firm. The positive aspect of this is that it is easier to obtain 
historical data on the existence of a SBU than its disaggregated financial performance. However, 
not all exits are equal or even directionally equivalent. Mergers or being acquired suggest higher 
performance than bankruptcy. Indeed, being acquired generally results in shareholders of the 
target receiving a premium (Jarrell & Poulsen, 1987). And in the case of many smaller firms, 
being acquired is potentially a signal of success and strength rather than weakness. Furthermore, 
the infrequency of liquidations suggests that survival is unlikely to provide sufficient variance to 
discriminate between high and low performing firms in studies that focus on short-term phenom- 
ena. Indeed, Hannan and Freeman (1977) note that of the 1955 Fortune 500, only 1 (0.2%) had 
liquidated by 1975 whereas 122 (24.4%) had merged. Of the remainder, 268 (53.6%) remained 
and 109 (21.8%) had slipped off the list due to relatively low growth. This also illustrates that 
categorical survival measures provide only limited insight into the relative performance of the 
heterogeneous firms that are placed into each category, severely curtailing the dimensionality of 
performance. At the extreme, survival measures would provide no information on the relative 
performance of the 13 firms that existed during the American Civil War that still survived in 1976 
(Hannan & Freeman, 1977). 


Subjective Measures of Organizational Performance 
Subjective measures ask supposedly well-informed respondents (key informants) about 


organizational performance. This allows them to be strongly tailored to the dimensionality 
of the context of interest. Broadly speaking, subjective measures can be divided into two 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


Richard et al. / Organizational Performance 735 


groups: (a) those that are fully subjective and (b) those that replicate objective measures, 
which we term quasi-objective. Subjective measures have traditionally been viewed with a 
great degree of skepticism based on the fact that the more objective the focal construct is, 
the less scope there is for bias. Subjectivity introduces increased error by allowing the imper- 
fections of human cognition to play a greater role (Gilovich, Griffin, & Kahneman, 2002; 
Kahneman & Tversky, 2000). Empirically, this has been supported by studies of the Fortune 
reputation index, a broad multidimensional subjective data index, showing that evaluator 
assessments of the more specific dimensions were better at explaining objective performance 
than those on more abstract dimensions (McGuire et al., 1990). 

Yet, despite these issues, the broadening of the theoretical and normative aspect of firm 
performance has led to increased interest being focused on the more subjective dimensions 
of performance in recent years. This increased attention has been in line with the trend of 
assessing performance against a triple bottom line of economic, social, and environmental 
performance and against balanced scorecards that add customer, internal process, and inno- 
vation measures to the measurement of financial performance (Kaplan & Norton, 1996). To 
illustrate the trend and its implication, we focus on two of these areas: corporate social per- 
formance (CSP) and reputation. 

Assessments of CSP have become increasingly common and found their way into analysts’ 
evaluations of firms’ strategic and financial positions. CSP is now considered alongside financial 
performance in many practical applications and is increasingly finding its way into mainstream 
regulatory requirements. This has further blurred the application of the distinction between orga- 
nizational performance and organizational effectiveness. More than 4,700 firms from 130 coun- 
tries, including 108 of the 2007 Financial Times Global 500, have committed to providing annual 
assessments of CSP under the United Nations’ Global Compact framework (United Nations, 
2007). Researchers, too, have responded to this trend. By 2003, there were already 52 published 
studies that estimated the correlation between CSP and financial performance (see Orlitzky, 
Schmidt, & Rynes, 2003). Yet, even here, the results are not clear and depend critically on context 
and measurement. A meta-analytic review of these studies showed a true correlation between 
CSP and more traditional financial performance measures of 0.36 (Orlitzky et al., 2003). 
However, Margolis, Elfenbein, and Walsh (2007) find that what effects exist are small, dependent 
on the measures used and categories studied, and bidirectional—logically and empirically, it is as 
likely that financial performance causes CSP as it is that CSP drives financial performance. The 
most common measure of CSP is the index produced by Kinder, Lydenberg, and Domini (KLD). 
The KLD index assesses CSP based on social “‘strengths” and “concerns,” including the environ- 
ment, diversity, employee relations, human rights, product features, and corporate governance. A 
recent analysis of the KLD index found that it was able to explain 69% of the variance in ROA 
and 39% of the variance in EPS for a sample of 734 corporations in the period from 1997 to 2002 
(Van der Laan et al., 2008). 

The Fortune reputation surveys also subjectively assess performance (see Chakravarthy, 1986; 
Fombrun & Shanley, 1990). The Fortune reputation measures are based on subjective evaluations 
by managers and stakeholders of eight dimensions: financial soundness, long-term investment 
value, use of corporate assets, quality of management, quality of products (or services), innova- 
tion, ability to attract and retain talented people, and social responsibility. The eight dimensions 
have been found to have a reliability of a = 0.97 (Fombrun & Shanley, 1990), are strongly 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


736 Journal of Management / June 2009 


correlated with past financial performance (Rowe, Cannella, Harris, & Francolini, 2003), and 
have a modest correlation with future performance (McGuire et al., 1990). The reputation mea- 
sures also contain explanatory power distinct from that of financial performance. Roberts and 
Dowling (2002) found that reputation included a clear “financial reputation” component that was 
related to earlier financial performance but that there was also a material “residual reputation.” 
Residual reputation predicted future performance even though there was evidence that residual 
reputations were supported by factors (e.g., brand advertising) that actually weaken short-term 
profitability (Roberts & Dowling, 2002). 


Fully subjective measures. Fully subjective self-report measures allow researchers to 
address latent performance constructs directly. Instead of asking for opinions on some objec- 
tive measure, as in the case of quasi-objective measures, fully subjective self-report ques- 
tions assess the underlying performance construct itself. An important aspect of this direct 
approach is that because these measures are not anchored to any definite object, they are 
inherently relative (March & Sutton, 1997). For instance, the respondent may be asked to 
compare the performance of the company to that of a rival, to management expectations, or 
to some other benchmark. For instance, Gupta and Govindarajan’s (1986) performance mea- 
sure had respondents rate “performance relative to superiors’ expectations on a 5-point scale 
ranging from not at all satisfactory to outstanding” (p. 713). The lack of a fixed reference 
point also provides subjective methods with much flexibility. Researchers are able to define 
the content and anchoring of questions to target the dimensions of performance directly, 
either individually or in aggregate. However, this relativity can make fully subjective mea- 
sures unreliable in the face of the highly variable aspirations of respondents. 

Fully subjective measures face challenges from psychological biases, which can lead to 
highly skewed and unrepresentative outcomes. Cognitive biases can influence subjective 
measures, particularly self-report measures from individuals who are part of the focal orga- 
nization. For example, evidence suggests that respondents tend to view themselves positively 
(Taylor & Brown, 1988), construe external criteria in their favor (Stajkovic & Sommer, 
2000), and rely on causal ambiguity to take credit for positive outcomes (Campbell & 
Sedikides, 1999). The halo effect can also materially affect perceptions of performance 
(Rosenzweig, 2007). Sound research design can reduce these biases. For instance, the qual- 
ity of self-report is increased the closer collection is to the event (Arnold, Collier, Leech, & 
Sutton, 2000; Mezias & Starbuck, 2003) and by selecting well-informed respondents 
(Winter, 2003). McGuire et al.’s (1990) study of the Fortune reputation index found that 
items with more specific definitions contained less measurement error. 

Yet, despite the obvious issues, the correlation between subjective and objective measures 
has been shown to be between 0.4 and 0.6 (Wall et al., 2004), with correlations as high as 
0.81 achieved by using more specific subjective constructs (Guthrie, 2001). Venkatraman 
and Ramanujam (1987) compared objective COMPUSTAT data with the subjective percep- 
tions of senior managers for sales growth, net income growth, and ROI and found correla- 
tions of 0.44, 0.42, and 0.51 and similar levels of trait, method, and random error variance 
that constituted roughly 50%, 30%, and 20% of variance, respectively. In addition to evi- 
dence supporting moderate convergent validity, Wall et al. (2004) found a high level of dis- 
criminant validity with the correlation between subjective and objective profit and 
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productivity measures found to be significantly higher than the correlation between these 
measures and their values for previous years. This result is all the more impressive given the 
well-accepted autocorrelation of performance due to reputation. The measures were also 
found to display strong construct validity, in that they related to measures of other related 
constructs in a consistent way (Wall et al., 2004). These empirical findings suggest that 
researchers should not view the choice of subjective measures as a second-best alternative 
but, instead, should weigh the tradeoffs between subjective and objective measures against 
the research context to determine which is more favorable under the circumstances. 


Quasi-objective measures. Quasi-objective measures elicit specific objective performance 
information through self-report techniques, for instance, by asking a salesperson the level of 
sales or the CEO to estimate the market value of the firm. Venkatraman and Ramanujam 
(1986, 1987) distinguished these measures from objective values collected from secondary 
sources. However, this is a harsh distinction, as the vulnerabilities of accounting systems 
suggest that similar imperfections can attend both sources. Influential research has often 
treated these as equivalent to the fully objective measures they reflect. 

Dess and Robinson (1984) compared quasi-objective and fully subjective measures of 
ROA and sales growth in privately held firms. This review of performance measures indi- 
cates that although quasi-objective and fully subjective measures share much variance, some 
variation remains between them. This is consistent with the dimensionality of performance 
itself. For instance, it would be surprising if a single organizational action affected account- 
ing, financial market, survival, and subjective measures of performance equally. Indeed, a 
single action like an asset sale might improve accounting performance while hurting finan- 
cial market assessments. This suggests the need to select a measure of performance that is 
closely related to the research question under investigation. It is unfortunate that what this 
implies (particularly given the bias against publishing insignificant results) is a tendency to 
select performance measures for which significant effects are either found or likely to be 
found rather than, more correctly, relating a series of theoretical antecedents to an overarch- 
ing construct of performance for which some dimensions are relevant and others are not. 
Researchers need to maintain a broad measure of performance—one that accounts for its 
multidimensionality but also one that allows for the variation between measures. 


Implication 4: Performance measures should not be made specific to the research question but be 
sufficiently robust to cover the domain of organizational performance. 


Performance measurement is further complicated by the availability of the data needed to 
construct the measures and the need to carefully specify how the data and measures relate to 
other constructs in a model and to one another. The practical issues of collecting the data 
appropriate for specific measures can be material. For many areas of research, no objective 
financial market or accounting measures exist. For example, when measuring the perfor- 
mance of the subsidiaries of multinational enterprises (MNE), there may be no objective 
accounting or financial market data available publicly and, instead, one must rely on subjec- 
tive managerial estimates. However, even if we put this practical concern aside, the main 
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problem is the nature of the relationships between the various measures themselves. The 
specification of models requires consideration of a number of relationships, as financial, 
product market, and shareholder performance outcomes are associated with many anteced- 
ents as well as being related to past performance. 

There is little agreement between researchers on either an accepted definition of perfor- 
mance or the appropriate structural form of the relationships between measures. This has 
produced wide variation in the specification of models used. The divergence in definitions 
of performance is evident in 14 papers in the appendix that adopt effectiveness measures that 
they describe as measuring performance. Moreover, establishing a model requires that 
researchers specify relationships between accounting, financial market, and other measures 
of performance, between objective and subjective measures of performance, and between 
performance measures and measures of organizational effectiveness. Research has identified 
a number of structural relationships between the antecedents of performance and perfor- 
mance outcomes that complicate this challenge. For instance, Bryant, Jones, and Widener 
(2004) found that measures within a balanced scorecard were strongly interrelated, with 
learning having direct and indirect effects on process and customer outcomes, which in turn 
predicted financial performance. 

The measures selected and measurement approach used will also affect research findings. 
Measurement model selection can materially affect the fit of research models (Ailawadi, Dant, & 
Grewal, 2004). The level of analysis at which measures are collected also matters, with a com- 
parison of coefficients of 10 of the most common predictors of financial performance across 163 
studies producing three cases where coefficients with different signs were produced at the busi- 
ness, firm, and industry levels (Capon et al., 1990). Capon et al. report that model specification 
and estimation method explain significant amounts of variance in reported relationships between 
accounting and financial market measures and measures of organizational effectiveness and that 
these effects can be larger than the variance explained by the selection of alternate financial per- 
formance measures. Even the specification of performance measures, either as direct reflective 
indicators of the single-factor performance construct or as second-order indicators of outcomes 
that formatively define a latent construct, is nontrivial (see Rowe & Morrow, 1999). Hence, it is 
not clear that simply having more and varied measures increases the clarity of measured perfor- 
mance unless they are also modeled appropriately. 


Implication 5: Measurement of performance requires an understanding of the relationship between 
measures. 


The synthesis between the review of dimensionality and the review of measures reveals 
that there is no singular measure of performance that is without limitations. Subjective mea- 
sures are easily rejected by senior managers and academics as biased, yet the empirical 
evidence shows that they are not inherently erroneous and that similar criticism could be 
raised with respect to the limitations of objective measures. Devinney, Yip, and Johnson 
(2009), using data from more than 3,000 global firms in 38 industries, show that the pattern 
of correlations between different measures is such that a minimum of three dimensions is 
necessary just to characterize the basic aspects of performance. 
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Measurement Implications and Discussion 


Our discussion highlights the implications of using organizational performance in manage- 
ment research. The most important challenge is to isolate how methodological approaches can 
address the multidimensional nature of performance. There are a number of alternatives available 
in attempting to resolve the performance measurement dilemma that do not require coming up 
with completely new measures (in other words, that rely on some combination of accounting, 
financial market, and objective and subjective measures of some type). The discussion below 
examines how specific measurement approaches can address the five implications identified. 
Table 5 summarizes the measurement methods that can address these implications. 

What follows is less of a summary of “what works” and more of a call for specific 
research on how we currently do measure organizational performance and how we might do 
so in the future. What is clear from our earlier discussion and what will follow is that there 
are many holes in a critical component of what we as management researchers do and that 
the only way to begin plugging those holes is to come to grips with how well we have mea- 
sured organizational performance and how that can be improved. 


Triangulation and Longitudinal Analysis: A Reassessment of an Old Story 


There has been a long tradition that has called for both triangulation from multiple mea- 
sures and the application of longitudinal analysis, both of which have had only a marginal 
effect on the management research literature. The review of articles in the appendix found 
that single measures, or single measures recorded over time, were used in 105 of the 213 
papers. That slightly less than half of the empirical papers in the leading journals in manage- 
ment rely on the dimensionality of a single measure is a serious concern and justifies the 
need for further debate. Even where multiple measures were used, in only 33% of instances 
these were collected from multiple measurement sources (such as accounting and financial 
market data or from executive surveys and secondary sources) that allowed measurement 
error to be isolated. Only 9 studies used the broadest triangulation by adopting both objective 
and subjective measures. Longitudinal data were more common, with 48% of studies col- 
lecting data over time. However, these data were not used in the most effective manner, with 
many studies simply averaging performance figures or placing them in cross-sectional mod- 
els. Sixty-five studies (including several population ecology articles) adopted models that 
allowed for time series properties, but only 16 of these had multiple measures. This is dis- 
heartening as these were relatively well-publicized conclusions of earlier reviews of perfor- 
mance measurement (see Chakravarthy, 1986; Venkatraman & Ramanujam, 1986). 


Triangulation. Triangulation with multiple measures offers the advantage of simultane- 
ously reducing measurement error and improving construct validity (Campbell & Fiske, 
1959; Venkatraman & Ramanujam, 1987). This is, of course, conditional on the fact that the 
multiple measures are tapping the same theoretical domain. This allows triangulation to 
address Implications | and 2 by allowing for the concerns of multiple stakeholders and 
across heterogeneous contexts. Triangulation also maintains the robustness required in 
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Table 5 


Measurement methods and methodological validation needed 


Rationale for Methodology 
(review implications) 


Methodological 
Solution(s) 


Present Knowledge and Validating Research Needed 


Implication 1: Measuring triangulation of 
performance requires 
weighing the relevance 
of performance to focal 
stakeholders. 


multiple measures 


Implication 2: 
Measurement of 
performance must take 
into account 
heterogeneity of 
environments, strategies, 
and management 
practices. 


triangulation of 
multiple measures 
and data 
envelopment 
analysis (DEA) 


Implication 3: 
Measurement of 
performance requires an 
understanding of the 
time series properties 
relating organizational 
activity to performance. 


longitudinal data 


Implication 4: 
Performance measures 
should be sufficiently 
robust to cover the 
domain of organizational 
performance. 


triangulation of 
multiple measures 


Implication 5: 
Measurement of 
performance requires an 
understanding of the 
relationship between 
measures. 


nonparametric 
approaches: DEA 


Present: Financial market and accounting measures are 
favored due to their importance to stockholders and 
managers (Dore, 2000; Jensen & Meckling, 1976). 

Needed: (a) Research that shows which measures best 
address other stakeholders. (b) Research examining the 
importance of different stakeholders across 
heterogeneous environments. 

Present: Researchers use discipline-specific measures with 
their relationship to wider organizational performance 
being assumed. 

Needed: (a) Research that systematically links these 
specific measures to the broader performance construct 
while retaining the reality that there are multiple paths 
to performance. (b) Research on how the internal 
incentive and measurement systems (such as balanced 
scorecards) operating in organizations influence the 
dimensionality of performance. 

Present: Performance measures should span sufficient time 
to overcome random noise but also be split into fine 
enough periods to observe variation in performance. 
There is growing coalescence around the need for a 
10-year timeframe to overcome random variation 
(see Kirby, 2005). 

Needed: Research examining the time series properties 
of phenomena of interest. This will provide a more 
systematic understanding of when finer time periods 
and lagged measures are needed. 

Present: Studies use a narrow range of measures and favor 
objective measures. The incremental benefit of using 
multiple objective measures has been shown (Chen & 
Dodd, 1997). 

Needed: Research on the additional explanatory power 
provided by using subjective and objective measures 
together. 

Present: Studies using multiple measures create an index or 
use factor analytic techniques. These impose an artificial 
dimensional structure that is unlikely to capture the 
dimensionality of performance. 

Needed: Research applying nonparametric approaches, 
such as DEA, and comparisons of these against factor- 
analytic, structural equations and regression-based 
approaches. 
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Implication 4. It is surprising that triangulation is rare in management research (Schriesheim, 
Powers, Scandura, Gardiner, & Lankau, 1993) and, when used, invariably relies on a reflec- 
tive formulation where construct validity is akin to high reliability as measured by Cronbach’s 
alpha (e.g., Nadkarni & Narayanan, 2007). This, less than subtly, flies in the face of the 
multidimensionality of performance because it is the independence of the dimensions that 
makes triangulation relevant. Hence, appropriate triangulation requires not just aggregation 
but also a theoretical understanding of how to triangulate the components of performance— 
something that has yet to be effectively addressed by any area of research. 

The importance of this last point can be seen in the data presented in Tables 6 and 7. These 
data show the factors extracted from 10 performance measures spanning accounting, stock 
market, and mixed accounting/stock market approaches that covered 3,000 firms in 38 industries 
across 125 countries for 20 years (Devinney et al., 2009). The results are compelling and clear 
and illustrate the major problem with triangulated approaches based on factor analysis: They do 
not work effectively when the structure of the construct is multidimensional, possesses nonre- 
cursive properties, and has complex interactions among items. It is unfortunate that these are 
exactly the properties possessed by organizational performance measures, which, as in Table 7, 
generate several factors each with a narrow coverage. In addressing Implications 1, 2, and 4, 
factor analysis violates Implication 5 by imposing an artificial structure. What is also important 
to understand is that the measures used in Tables 6 and 7 are more likely to be consistent and 
clear, unlike subjective measures where additional biases are introduced. The inclusion of sub- 
jective measures is likely to exacerbate problems with factor analytic approaches. 


Longitudinal analysis. Extending one’s measures to include longitudinal data (i.e., mea- 
sures repeated over time) can be used to tap the time dependent nature of performance as 
well as serving to remove error. By selecting sufficiently short individual periods to observe 
changes in performance, while tracking this over a longer period of time, the conflicting 
needs of Implication 3 can be achieved. Using longitudinal data in this way abstracts away 
from the absolute level of performance and instead focuses attention on changes in perfor- 
mance, effectively controlling for time invariant error. Ailawadi et al. (2004) found that 
adopting this approach reduces the influence of common method errors, such as format 
effects. It also corrects for contextual firm-specific fixed effects (Boulding, 1990). Research 
suggests that a 10-year period is needed to validly identify high performance, with 15 years 
being preferable (Collins & Porras, 1994; Foster & Kaplan, 2001; Kirby, 2005). Collecting 
data over multiple time periods also allows the use of pooled data methods that can test for 
and account for shifting model parameters as well as those associated with the time series 
itself (Bowen & Wiersema, 1999). This insight into parameters provides additional informa- 
tion that assists in addressing Implication 5 (see Table 5). 


Addressing the Dimensionality of Organizational 
Performance Through Nonparametric Means 


To date, nearly all performance measurement approaches are parametric and invariably 
aim at reducing the dimensionality of a set of items into a more easily interpretable, reflective 
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Table 6 
Factor analysis of performance measures (structure matrix) 

Performance Measure 1 2 3 4 

Profit margin 0.408 —0.001 0.199 0.922 
Return on shareholder funds 0.827 -0.018 0.025 0.213 
Return on total assets 0.866 0.005 0.055 0.511 
Return on capital employed 0.900 —0.005 0.039 0.390 
Cash flow to operating revenue 0.341 —0.002 0.185 0.933 
Return on sales 0.083 0.027 0.896 0.275 
Change in market capitalization 0.058 0.939 0.046 0.064 
Total shareholder return 0.051 0.935 0.038 0.058 
Sales growth 0.013 0.030 0.917 0.016 
Tobin’s g 0.409 0.238 0.100 0.263 
Eigenvalue 3.442 1.820 1.702 1.066 


Source: Devinney, Yip, and Johnson (2009). 
Note: Oblimin rotation, loadings above 0.4 in bold. 


Table 7 
Component correlation matrix (oblimin rotation) 
Component 1 2 3 4 
1 1.000 
2 0.097* 1.000 
3 0.081* 0.069* 1.000 
4 ~0.207#* 0.110** ~0.088* 1.000 


Source: Devinney, Yip, and Johnson (2009). 
*p < 05. **p < 01. 


construct. As noted, this undermines the actual dimensionality by imposing an artificial 
measurement structure and weakens triangulation by leading to the generation of several 
factors each with a narrow coverage. To preserve underlying dimensionality, it is more trac- 
table to operate at a disaggregated level and not impose a relationship between measures 
(unless, of course, one has a sound theoretical model informing you as to what the specific 
measure means, as in the case of Tobin’s g and economic return). 

From a theoretical perspective, the parametric approaches used in the literature are based 
on central tendencies (or, in the case of survival, on specific distributional specifications). In 
other words, for most studies, high performance is based on a mathematical averaging of a 
set of underlying measures against an assumed distribution of performance. However, in 
contrast to the usual comparisons against “average” performance, it could be argued quite 
compellingly that resource-based theories of strategic advantage are actually based on the 
notion of dominance (e.g., Barney, 1991; Devinney, Midgley, & Venaik, 2001). According to 
Devinney et al. (2009), 
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A firm “strong-form dominates” (or outperforms) a set of other firms when it does better than 
all those firms on all of the components of the set of performance indicators. . . . Any firm 
“weak-form dominates” a set of other firms when it does better than all those firms on at least 
one component of a set of performance indicators. . .. Any firm outperforms (in terms of linear 
dominance) another set of firms when there is no linear combination of those other firms that 
can do better across all dimensions in the set of performance indicators. 


The idea of dominance leads one to the formulation of a frontier of performance that is 
theoretically structured to match with nonparametric linear programming approaches such 
as Data Envelopment Analysis (DEA; Charnes, Cooper, & Rhodes, 1978). By allowing the 
relationship between the measures to be based on their observed nature, DEA avoids making 
the assumptions that create the problems associated with Implication 5. This also allows the 
model to retain natural heterogeneity, addressing Implication 2. As DEA is a multiple input— 
multiple output method, it theoretically can account for multidimensionality (Yip, Devinney, 
& Johnson, 2008). 

Research has shown that such nonparametric techniques can not only represent the mul- 
tidimensional nature of performance in a superior fashion to the traditional parametric alter- 
natives (Devinney et al., 2009) but also provide many distinctive practical advantages 
relative to regression-based approaches (Thanassoulis, 1993). Feroz, Kim, and Raab (2003) 
showed that DEA approaches can be used to simultaneously extract information from 
accounting ratios without making comparisons “one ratio at a time” and avoiding ad hoc 
aggregation of different performance measures. The method can be further strengthened by 
the inclusion of classification information. Cook and Bala (2007) discuss a two-stage DEA 
process that is able to assign cases to separate groups. This can allow for discontinuous 
heterogeneity. For instance, this would allow discontinuities in environmental conditions 
(e.g., between institutional environments) or strategic groups (i.e., between groups of firms 
with relatively homogeneous assets and strategies) to be specified in the model. 


A Call for More Concern About Performance Measurement 


The above discussion and the summary in Table 5 emphasize the need for a greater ques- 
tioning into the nature of performance measurement. There are five findings that follow that 
we put out as a call for more research questioning how we address our ultimate dependent 
variable. 

First, we need theoretically stronger discipline-specific measures that address the rela- 
tionship between organizational practices and organizational performance. To date, we 
assume that our measures of performance matter and are appropriate. Unlike our indepen- 
dent variables (on which we form hypotheses and that are the focus of our tests and theories), 
we do not justify why we are using particular dependent variables and why they are the most 
appropriate dependent variables rather than the most conveniently available in COMPUSTAT. 
Second, we require research that systematically links the performance measures used to a 
broader construct of sustainable organizational performance in line with extant theories of 
strategic advantage. These theories argue that (a) sustainable performance is possible and 
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(b) there are multiple paths to such performance. Third, we require more effective research 
examining the cross-sectional and time-series properties of performance and the generation 
of established recognized benchmarks. Unlike finance, we have no established methods or 
measures that are replicated and benchmarked and, hence, no basis on which to compare 
research findings. Fourth, we require research that addresses the relationship between differ- 
ent levels of analysis and different methods, particularly subjective versus objective mea- 
sures and business unit, subsidiary, and corporate level performance. Fifth, we require 
research into the application of new and/or different mathematically based approaches to 
measurement. To date, management research on performance has been locked into three 
methodological paradigms: sociology (survival analysis), psychology (mainly applying psy- 
chometric techniques), and economics (econometrics). These are restrictive in the sense that 
they derive from disciplinary-based assumptions about individual and firm behavior that is 
accepted rather than tested. 


Conclusion 


Organizational performance is important to scholars across the entire domain of manage- 
ment research. Strategy and accounting scholars seek to influence and measure organiza- 
tional performance. Scholars in marketing, operations, and human resource management 
seek to understand and improve performance. In doing so, they each adopt discipline- 
specific measures such as customer satisfaction, productivity, and employee satisfaction (see 
Chenhall & Langfield-Smith, 2007). Understanding how such discipline-specific measures 
load onto the dimensions of organizational performance and the interrelationships between 
specialist measures is essential to understanding the relationships between multiple organi- 
zational actions. This makes the valid measurement of an overarching performance construct 
important in allowing researchers in fields such as strategy, marketing, and human resource 
management to appreciate the linkages between the phenomena they study. 

This review identifies that any study that claims to address organizational performance 
must include strong theory that addresses two key issues: (a) the dimensionality of perfor- 
mance (1.e., establishing which measures are appropriate to the research context) and (b) the 
selection and combination of performance measures (i.e., establishing which measures can 
be usefully combined and the method of doing so). The first inquiry is about the nature of 
performance and the second is about the nature of measurement. To be strong, the theoretical 
rationale for an approach to performance measurement must be both comprehensive in its 
assessment and rigorous in validation. To be comprehensive, the theoretical rationale for the 
performance measures used in a research study must ensure that each of the five implications 
drawn from this review is addressed. Validation in turn requires empirical evidence proving 
that methods used to create the measures and the measures themselves possess stable and 
expected properties appropriate to the theoretical structure. 

Validating strong theory on the measurement of organizational performance is an impor- 
tant agenda for further research at both the theoretical and empirical levels. To a great degree, 
we have assumed that if we just had better data, we would be able to understand performance 
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better. This, we argue, is a fallacy. Even with better data, the multidimensionality of perfor- 
mance will imply that we need theory to help us understand how the dimensions go together. 
Similarly, as the link between the context and the measures is critical, we need to understand 
how the specific performance measures we use are influenced by the complex combination 
of context and actions over time. As it is unlikely that objective measures alone will capture 
this, we require research on those combinations of subjective and objective measures that 
best capture performance, over what time period fluctuations in performance appear, and 
most important, a broader exploration of the paths that link heterogeneous environments, and 
firm characteristics, practices, and strategies, to overall organizational performance. 

In the end, this review aims to provide not just an overview of what we know about 
performance as a dependent variable but what we don’t know. What we don’t know and how 
we address our ignorance will determine whether or not management research succeeds. For 
without the ability to link managerial prescriptions based on theory to practical and observ- 
able and justifiable performance outcomes, management research will be little more than 
informed speculation or, as The Economist (“Business Schools and Research,” 2007) points 
out, “practically irrelevant.” 
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Notes 


1. This frequency of performance measurement has continued to be strong with 24% of papers in the same three 
journals adopting performance as a dependent variable during 2005 to 2007 (see Table 1). 

2. We place innovation as outside of firm performance. Although it is often an important antecedent of perfor- 
mance and is often described as “performance,” including in four papers in the appendix, here, innovation is treated 
as part of the wider concept of organizational effectiveness. 

3. Although we do not discuss it, the existence of organized and well-structured databases is another driver as 
to the prevalence of the use of specific financial performance measures and the sorts of companies studied. For 
example, COMPUSTAT and CRSP have allowed for both the more consistent and faster publication of findings and 
more direct comparability. However, COMPUSTAT and similar databases invariably include a specific bias as to 
which measures are most relevant and which firms get studied most. 

4. Stern Stewart recommends 160 adjustments to overcome such distortions (Ittner & Larcker, 1998). 


References 


Ailawadi, K. L., Dant, R. P., & Grewal, D. 2004. The difference between perceptual and objective performance 
measures: An empirical analysis (Marketing Science Institute Working Paper No. 04-001). Cambridge, MA: 
Marketing Science Institute. 

Altman, E. 1968. Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. Journal of 
Finance, 23: 589-609. 

Arnold, V., Collier, P. A., Leech, S. A., & Sutton, S. G. 2000. The effect of experience and complexity on order and 
recency bias in decision-making by professional accountants. Accounting & Finance, 40: 109-134. 

Aupperle, K. E., Acar, W., & Booth, D. E. 1986. An empirical critique of In Search of Excellence: How excellent 
are the excellent companies? Journal of Management, 12: 499-512. 

Baker, G. P., & Kennedy, R. E. 2002. Survivorship and the economic grim reaper. Journal of Law, Economics and 
Organization, 18: 324-361. 

Barney, J. 1991. Firm resources and sustained competitive advantage. Journal of Management, 17: 99-120. 

Baum, J. R., & Wally, S. 2003. Strategic decision speed and firm performance. Strategic Management Journal, 24: 
1107-1130. 

Benston, G. J. 1985. The market for public accounting services: Demand, supply and regulation. Journal of 
Accounting and Public Policy, 4: 33-79. 

Biddle, G. C., Bowen, R. M., & Wallace, J. S. 1997. Does EVA beat earnings? Evidence on associations with stock 
returns and firm values. Journal of Accounting and Economics, 24: 301-336. 

Blattberg, C. 2000. From pluralist to patriotic politics: Putting practice first. Oxford, UK: Oxford University 
Press. 

Bond, S. R., & Cummins, J. G. 2000. The stock market and investment in the new economy: Some tangible facts 
and intangible fictions. Brookings Papers on Economic Activity, 1: 61-108. 

Boulding, W. 1990. Unobservable effects and business performance: Do fixed effects matter? Marketing Science, 
9: 88-91. 

Bowen, H., & Wiersema, M. 1999. Matching method to paradigm in strategy research: Limitations of cross- 
sectional analysis and some methodological alternatives. Strategic Management Journal, 20: 625-636. 

Boyd, B. K., Gove, S., & Hitt, M. A. 2005. Construct measurement in strategic management research: Illusion or 
reality? Strategic Management Journal, 26: 239-257. 

Bryant, L., Jones, D. A., & Widener, S. K. 2004. Managing value creation within the firm: An examination of mul- 
tiple performance measures. Journal of Management Accounting Research, 16: 107-132. 

Bulow, J. I, Geanakoplos, J. D., & Klemperer, P. D. 1985. Multimarket oligopoly: Strategic substitutes and comple- 
ments. Journal of Political Economy, 93: 488-511. 

Business schools and research: Practically irrelevant? 2007, August 28. Economist.com. Retreived from http://www. 
economist.com/business/globalexecutive/displaystory.cfm?story_id=E1_JSDSQJR, 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


800 Journal of Management / June 2009 


Callen, J. L. 1991. Data envelope analysis: Partial survey and applications for management accounting. Journal of 
Accounting Management Research, 3: 35-56. 

Cameron, K., & Whetten, D. 1983. Organizational effectiveness: One model or several? In K. Cameron & D. Whetten 
(Eds.), Organizational effectiveness: A comparison of multiple methods: 1-24. New York: Academic Press. 

Campbell, D. T., & Fiske, D. W. 1959. Convergent and discriminant validation by the multitrait-multimethod 
matrix. Psychological Bulletin, 56: 81-105. 

Campbell, W., & Sedikides, C. 1999. Self-threat magnifies the self-serving bias: A meta-analytic integration. 
Review of General Psychology, 3: 23-43. 

Capon, N., Farley, J. U., & Hoenig, S$. 1990. Determinants of financial performance: A meta-analysis. Management 
Science, 36: 1143-1159. 

Casciaro, T., & Piskorski, M. J. 2005. Power imbalance, mutual dependence, and constraint absorption: A closer 
look at resource dependence theory. Administrative Science Quarterly, 50: 167-199. 

Chakravarthy, B. S. 1986. Measuring strategic performance. Strategic Management Journal, 7: 437-458. 

Chan, L.K.C., Megadeath, N., & Lakonishok, J. 1996. Momentum strategies. Journal of Finance, 51: 1681-1713. 

Chaney, P., & Philipich, K. 2002. Shredded reputation: The cost of audit failure. Journal of Accounting Research, 
40: 1221-1245. 

Charnes, A., Cooper, W. W., & Rhodes, E. 1978. Measuring the efficiency of decision making units. European 
Journal of Operations Research, 2: 429-444. 

Chen, S., & Dodd, J. L. 1997. Economic value added (EVA): An empirical examination of a new corporate perfor- 
mance measure. Journal of Managerial Issues, 9: 319-333. 

Chen, S., & Dodd, J. L. 2001. Operating income, residual income and EVA: Which metric is more value relevant? 
Journal of Managerial Issues, 13: 65-86. 

Chenhall, R. H., & Langfield-Smith, K. 2007. Multiple perspectives of performance measures. European 
Management Journal, 25: 266-282. 

Cho, H. J., & Pucik, V. 2005. Relationship between innovativeness, quality, growth, profitability and market value. 
Strategic Management Journal, 26: 555-576. 

Clarkson, M.B.E. 1995. A stakeholder framework for analyzing and evaluating corporate social performance. 
Academy of Management Review, 20: 92-117. 

Collins, J. C., & Porras, J. 1. 1994. Built to last: Successful habits of visionary companies. New York: Harper 
Business. 

Contractor, F. J., Kundu, S. K., & Hsu, C. C. 2003. A three-stage theory of international expansion: The link 
between multinationality and performance in the service sector. Journal of International Business Studies, 34: 
5-18. 

Cook, W. D., & Bala, K. 2007. Performance measurement and classification data in DEA: Input-oriented model. 
Omega, 35: 39-52. 

Cutler, D. M., Poterba, J. M., & Summers, L. H. 1989. What moves stock prices? Journal of Portfolio Management, 
15: 4-12. 

Danielson, M., & Press, E. 2003. Accounting returns revisited: Evidence of their usefulness in estimating economic 
returns. Review of Accounting Studies, 8: 493-530. 

Davig, W., Elbert, N., & Brown, S. 2004. Implementing a strategic planning model for small manufacturing com- 
panies: An adaptation of the balanced scorecard. $.A.M. Advanced Management Journal, 69: 18-25. 

Davis, P. S., & Pett, T. L. 2002. Measuring organizational efficiency and effectiveness. Journal of Management 
Research, 2: 87-98. 

Denrell, J. 2004. Random walks and sustained competitive advantage. Management Science, 50: 922-934. 

Dess, G. G., & Robinson, R. B., Jr. 1984. Measuring organizational performance in the absence of objective mea- 
sures: The case of the privately-held firm and conglomerate business unit. Strategic Management Journal, 5: 
265-273. 

Devinney, T. M., Midgley, D. F., & Venaik, S. 2001. The organizational imperative and the optimal performance of 
the global firm. Organization Science, 11: 674-695. 

Devinney, T. M., & Milde, H. 1990. Agency contracting and inside debt. Die Schweizerische Zeitschrift fiir 
Volkswirtschaft und Statistik, 126, 97-111. 

Devinney, T. M., & Stewart, D. W. 1988. Rethinking the product portfolio: A generalized investment model. 
Management Science, 34: 1080-1095. 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


Richard et al. / Organizational Performance 801 


Devinney, T. M., Yip, G. S., & Johnson, G. 2009. Using frontier analysis to evaluate company performance. British 
Journal of Management, forthcoming. 

Dore, R. 2000. Stock market capitalism: Welfare capitalism. Japan and Germany versus Anglo-Saxons. New York: 
Oxford University Press. 

Dunne, T., Roberts, J. M., & Samuelson, L. 1988. Patterns of firm entry and exit in U.S. manufacturing industries. 
RAND Journal of Economics, 19: 495-515. 

Dutta, S., & Reichelstein, S. 2005. Stock price, earnings, and book value in managerial performance measures. The 
Accounting Review, 80: 1069-1100. 

Edvinsson, L., & Malone, M. S. 1997. Intellectual capital: Realizing your company’s true value by finding its hid- 
den brainpower. New York: Harper Business. 

Edwards, J. R., & Bagozzi, R. P. 2000. On the nature and direction of relationships between constructs and mea- 
sures. Psychological Methods, 5: 155-174. 

Feroz, E. H., Kim, S., & Raab, R. L. 2003. Financial statement analysis: A data envelopment approach. Journal of 
the Operational Research Society, 44: 48-58. 

Fisher, F. M., & McGowan, J. J. 1983. On the misuse of accounting rates of return to infer monopoly profits. 
American Economic Review, 73: 82-97. 

Fitzgerald, L., & Storbeck, J. 2003. Pluralistic views of performance. Management Decision, 41: 741-750. 

Fombrun, C., & Shanley, M. 1990. What’s in a name? Reputation building and corporate strategy. Academy of 
Management Journal, 33: 233-258. 

Foster, R., & Kaplan, S. 2001. Creative destruction: Why companies that are built to last underperform the 
market—and how to transform them. New York: Currency/Doubleday. 

Freeman, R. E. 1984. Strategic management: A stakeholder approach. Boston: Pitman. 

Fryxell, G. E., & Barton, S. L. 1990. Temporal and contextual change in the measurement structure of financial 
performance: Implications for strategy research. Journal of Management, 16: 553-569. 

Gilovich, T., Griffin, D., & Kahneman, D. (Eds.). 2002. Heuristics and biases: The psychology of intuitive judg- 
ment. New York: Cambridge University Press. 

Goerzen, A., & Beamish, P. W. 2003. Geographic scope and multinational enterprise performance. Strategic 
Management Journal, 24: 1289-1306. 

Graham, J. R. 1999. Herding among investment newsletters: Theory and evidence. Journal of Finance, 59: 237-269. 

Grinblatt, M., Titman, S., & Wermers, R. 1995. Momentum investment strategies, portfolio performance, and herd- 
ing: A study of mutual fund behavior. American Economic Review, 85: 1088-1105. 

Gupta, A. K., & Govindarajan, V. 1986. Resource sharing among SBUs: Strategic antecedents and administrative 
implications. Academy of Management Journal, 29: 695-714. 

Guthrie, J. 2001. High-involvement work practices, turnover, and productivity: Evidence from New Zealand. 
Academy of Management Journal, 44: 180-190. 

Hall, B., Jaffe, A., & Trajtenberg, M. 2005. Market value and patent citations. Rand Journal of Economics, 36: 
16-38. 

Hannan, M. T., & Freeman, J. 1977. The population ecology of organizations. American Journal of Sociology, 82: 
929-964. 

Hawawini, G., Subramanian, V., & Verdin, P. 2003. Is performance driven by industry—or firm-specific factors? A 
new look at the evidence. Strategic Management Journal, 24: 1-16. 

Henri, J. F. 2004. Performance measurement and organizational effectiveness: Bridging the gap. Managerial 
Finance, 30: 93-123. 

Hillman, A. J., & Keim, G. D. 2001. Shareholder value, stakeholder management, and social issues. What’s the 
bottom line? Strategic Management Journal, 22: 125-140. 

Hofer, C. W. 1983. ROVA: A new measure for assessing organizational performance. In R. Lamb (Ed.), Advances 
in strategic management: Vol. 2, 43-55. New York: JAI Press. 

Ittner, C. D., & Larcker, D. F. 1998. Innovations in performance measurement: Trends and research implications. 
Journal of Management Accounting Research, 10: 205-238. 

Jacobson, R. 1987. The validity of ROI as a measure of business performance. American Economic Review, 77: 
470-478. 

Jacobson, R. 1988. The persistence of abnormal returns. Strategic Management Journal, 9: 415-430. 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


802 Journal of Management / June 2009 


Jarrell, G. A., & Poulsen, A. B. 1987. Shark repellents and stock prices: The effects of anti-takeover amendments 
since 1980. Journal of Financial Economics, 19: 127-168. 

Jensen, M. C., & Meckling, W. H. 1976. Theory of the firm: Managerial behavior, agency costs and ownership 
structure. Journal of Financial Economics, 3: 305-360. 

Jusoh, R., & Parnell, J. A. 2008. Competitive strategy and performance measurement in the Malaysian context: An 
exploratory study. Management Decision, 46: 5-31. 

Kahneman, D., & Riepe, M. W. 1998. Aspects of investor psychology. Journal of Portfolio Management, 24: 
52-65. 

Kahneman, D., & Tversky, A. 2000. Choices, values and frames. New York: Cambridge University Press. 

Kaplan, R. S., & Norton, D. P. 1996. The balanced scorecard: Translating strategy into action. Boston: Harvard 
Business School Press. 

Keats, B. W. 1988. The vertical construct validity of business economic performance measures. Journal of Applied 
Behavioral Science, 24: 151-160. 

Kirby, J. 2005. Toward a theory of high performance. Harvard Business Review, 83: 30-39. 

Laitinen, E. K., & Chong, G. 2006. How do small companies measure their performance? Problems and Perspectives 
in Management, 4: 49-68. 

Lehn, K., & Makhija, A. K. 1997. EVA, accounting profits, and CEO turnover: An empirical examination 1985- 
1994. Journal of Applied Corporate Finance, 10: 90-97. 

Lenz, R. T. 1981. Determinants of organizational performance: An interdisciplinary review. Strategic Management 
Journal, 2: 131-154. 

Lev, B. 2001. Intangibles: Management, measurement and reporting. Washington, DC: Brookings Institution. 

Levenson, A. R., Van der Stede, W. A., & Cohen, S. G. 2006. Measuring the relationship between managerial com- 
petencies and performance. Journal of Management, 32: 360-380. 

Luu, T., Kim, S., Cao, H., & Park, Y. 2008. Performance measurement of construction firms in developing countries. 
Construction Management and Economics, 26: 373-386. 

Malina, M. A., & Selto, F. H. 2004. Choice and change of measures in performance measurement models. 
Management Accounting Research, 15: 441-469. 

Malkiel, B. G. 2003. The efficient market hypothesis and its critics. Journal of Economic Perspectives, 17: 59-82. 

March, J. G., & Sutton, R. I. 1997. Organizational performance as a dependent variable. Organization Science, 8: 
698-706. 

Margolis, J. D., Elfenbein, H. A., & Walsh, J. P. 2007. Does it pay to be good? A meta-analysis and redirection of research 
on the relationship between corporate social and financial performance. Unpublished working paper. 

McGahan, A. M. 2004. How industries evolve: Principles for achieving and sustaining superior performance. 
Boston: Harvard Business School Press. 

McGahan, A. M., & Porter, M. E. 1997. How much does industry matter, really? Strategic Management Journal, 
18: 15-30. 

McGahan, A. M., & Porter, M. E. 1999. The persistence of shocks to profitability. Review of Economics and 
Statistics, 81: 143-153. 

McGahan, A. M., & Porter, M. E. 2003. The emergence and sustainability of abnormal profits. Strategic 
Organization, 1: 79-108. 

McGuire, J. B., Schneeweis, T., & Branch, B. 1990. Perceptions of firm quality: A cause or result of firm perfor- 
mance. Journal of Management, 16: 167-180. 

Mezias, J. M., & Starbuck, W. H. 2003. Studying the accuracy of managers’ perceptions: A research odyssey. British 
Journal of Management, 14: 3-17. 

Miles, R., & Snow, C. C. 1978. Organizational strategy, structure, and process. New York: McGraw-Hill. 

Miller, D. J. 2004. Firms’ technological resources and the performance effects of diversification: A longitudinal 
study. Strategic Management Journal, 25: 1097-1120. 

Milunovich, S., & Tseui, A. 1996. EVA in the computer industry. Journal of Applied Corporate Finance, 9: 104-115. 

Mitchell, R. K., Agle, B. R., & Wood, D. J. 1997. Toward a theory of stakeholder identification and salience: 
Defining the principle of who and what really counts. Academy of Management Review, 22: 853-886. 

Nadkarni, S., & Narayanan, V. K. 2007. Strategic schemas, strategic flexibility, and firm performance: The moder- 
ating role of industry clock speed. Strategic Management Journal, 28: 243-270. 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


Richard et al. / Organizational Performance 803 


Neely, A., & Bourne, M. 2000. Why measurement initiatives fail. Measuring Business Excellence, 4: 3-6. 

Orlitzky, M., Schmidt, F. L., & Rynes, S. L. 2003. Corporate social and financial performance: A meta-analysis. 
Organization Studies, 24: 403-441. 

Peng, M. W. 2004. Outside directors and firm performance during institutional transitions. Strategic Management 
Journal, 25: 453-472. 

Perfect, S. B., & Wiles, K. W. 1994. Alternative constructions of Tobin’s g: An empirical comparison. Journal of 
Empirical Finance, 1: 313-341. 

Peters, T. J., & Waterman, R. H. 1982. In search of excellence: Lessons from America’s best run companies. New 
York: Harper & Row. 

Powell, T. 2003. Varieties of competitive parity. Strategic Management Journal, 24: 61-86. 

Ramanujam, V., & Venkatraman, N. 1988. Excellence, planning and performance. /nterfaces, 18: 23-31. 

Rappaport, A. 1978. Executive incentives vs. corporate growth. Harvard Business Review, 56: 81-88. 

Rappaport, A. 1986. Creating shareholder value: The new standard for business performance. New York: Free Press. 

Roberts, P. W., & Dowling, G. R. 2002. Corporate reputation and sustained superior performance. Strategic 
Management Journal, 23: 1077-1094. 

Robinson, R. A. 2000. Tracking stocks. Computerworld, February 21: 52. 

Roll, R. 1988. R?. Journal of Finance, 43: 541-566. 

Rosenzweig, P. 2007. Misunderstanding the nature of company performance: The halo effect and other business 
delusions. California Management Review, 49: 6-20. 

Rowe, W. G., Cannella, A. A., Jr., Harris, I. C., & Francolini, T. 2003. In search of meaning: Does the Fortune 
reputation survey alter performance expectations? Canadian Journal of Administrative Sciences, 20: 187-195. 

Rowe, W. G., & Morrow, J. L., Jr. 1999. A note on the dimensionality of the firm financial performance construct using 
accounting, market, and subjective measures. Revue Canadienne des Sciences de l’Administration, 16: 58-70. 

Rumelt, R. P. 1991. How much does industry matter? Strategic Management Journal, 12: 167-185. 

Schneiderman, A. M. 1999. Why balanced scorecards fail. Journal of Strategic Performance Measurement, 3: 
6-11. 

Schriesheim, C. A., Powers, K. J., Scandura, T. A., Gardiner, C. C., & Lankau, M. J. 1993. Improving construct 
measurement in management research: Comments and a quantitative approach for assessing the theoretical 
content adequacy of paper-and-pencil survey type instruments. Journal of Management, 19: 385-417. 

Shiller, R. L. 1989. Market volatility. Cambridge, MA: MIT Press. 

Spanos, Y. E., Zaralis, G., & Lioukas, S$. 2004. Strategy and industry effects on profitability: Evidence from Greece. 
Strategic Management Journal, 25: 139-166. 

Stajkovic, A. D., & Sommer, S. 2000. Self-efficacy and causal attributions: Direct and reciprocal links. Journal of 
Applied Social Psychology, 30: 707-737. 

Steers, R. M. 1975. Problems in the measurement of organizational effectiveness. Administrative Science Quarterly, 
20: 546-558. 

Stern, J. M., Stewart, G. B., III, & Chew, D. R., Jr. 1995. The EVA financial management system. Journal of Applied 
Corporate Finance, 8: 32-46. 

Sveiby, K. E. 1997. The new organizational wealth: Managing and measuring knowledge based assets. San 
Francisco: Berrett Koehler. 

Taylor, S., & Brown, J. 1988. Illusion and well-being: A social psychological perspective on mental health. 
Psychological Bulletin, 103: 193-210. 

Thanassoulis, E. 1993. A comparison of regression analysis and data envelopment approaches as alternative meth- 
ods of performance assessment. Journal of the Operational Research Society, 44: 1129-1144. 

Tobin, J. 1969. A general equilibrium approach to monetary theory. Journal of Money, Credit and Banking, 1: 15-29. 

Tversky, A., & Kahneman, D. 1973. Availability: A heuristic for judging frequency and probability. Cognitive 
Psychology, 5: 207-232. 

United Nations. 2007. UN Global Compact: Annual review. New York: United Nations Global Compact Office. 

Van der Laan, G., Van Ees, H., & Van Witteloostuijn, A. 2008. Corporate social and financial performance: An 
extended stakeholder theory, and empirical test with accounting measures. Journal of Business Ethics, 79: 
299-310. 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


804 Journal of Management / June 2009 


Varadarajan, P. R., & Ramanujam, V. 1990. The corporate performance conundrum: A synthesis of contemporary 
views and an extension. Journal of Management Studies, 27: 463-483. 

Varaiya, N., Kerin, R. A., & Weeks, D. 1987. The relationship between growth, profitability, and firm value. 
Strategic Management Journal, 8: 487-497. 

Venkatraman, N., & Ramanujam, V. 1986. Measurement of business performance in strategy research: A compari- 
son of approaches. Academy of Management Review, 11: 801-814. 

Venkatraman, N., & Ramanujam, V. 1987. Measurement of business economic performance: An examination of 
method convergence. Journal of Management, 13: 109-122. 

Wall, T., Michie, J., Patterson, M., Wood, S., Sheehan, M., Clegg, C., & West, M. 2004. On the validity of subjective 
measures of company performance. Personnel Psychology, 57: 95-118. 

Wallace, J. S. 1997. Adopting residual income-based compensation plans: Do you get what you pay for? Journal of 
Accounting and Economics, 23: 275-300. 

Waring, G. F. 1996. Industry difference in the persistence of firm-specific returns. American Economic Review, 86: 
1253-1265. 

Who’s excellent now? 1984. Business Week, November 5: 76-86. 

Winter, S. G. 2003. Mistaken perceptions: Cases and consequences. British Journal of Management, 14: 39-44. 

Yip, G. S., Devinney, T. M., & Johnson, G. 2008, September 19. Measuring long term superior performance: The 
UK’s long-term superior performers 1984-2003 (SSRN Working Paper). Retrieved from http://ssrn.com/ 
abstract=1270438 


Downloaded from jom.sagepub.com at UNIV OF WISCONSIN on July 5, 2012 


